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(54) Modular storage method and apparatus for use with software applications 



(57) A method and system is provided that process- 
es the content of a document separately from informa- 
tion used to describe the structure of the content in the 
document. Content can be text, alphanumeric data, or 
any other information contained in a document. Initially, 
the method provides a content entity capable of storing 
and manipulating the content stored in a storage area. 
The manipulation of content in the storage area includes 
inserting content in the content entity identified by a start 
position and a stop position, removing content from the 
content entity, and retrieving content from the storage 
area. A separate document entity is provided that uses 
element entities to represent the structure of the text 
without the content. The document entity is used to re- 
trieve information from the element entities and deter- 
mine the location of content in the content entity. The 
information retrieved from the element entities is used 
to manipulate the content stored in the storage area as- 
sociated with the content entity. 
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Description 
Related Applications 

[0001] The following identified U.S. patent apptica- 5 
tions are relied upon and are incorporated by reference 
in their entirety in this application. 
[0002] Provisional U.S. Patent Application No. 
60/075,975, entitled "METHOD AND APPARATUS FOR 
USER INTERFACE WITH PLUGGABLE LOOK AND 
FEEL, n filed on February 24, 1998. 
[0003] U.S. Patent Application No. 
, entitled "METHOD AND APPARA- 
TUS FOR AN EXTENSIBLE EDITOR FOR USE WITH 
A GRAPHICAL USER INTERFACE,' 1 bearing attorney 
docket no. 06502.0126-00000, filed on the same date 
herewith. 

[0004] U.S. Patent Application No. 

, entitled "METHOD AND , entitled 

■METHOD AND APPARATUS FOR GENERATING 
TEXT COMPONENTS IN A GRAPHICAL USER IN- 
TERFACE," bearing attorney docket no. 
06502.0127-00000, filed on the same date herewith. 
[0005] U.S. Patent Application No. 08/885,61 4/P2390 
filed June 30, 1997, assigned to the assignee of the 
present invention and entitled "METHOD AND APPA- 
RATUS FOR GENERATING A GRAPHICAL USER IN- 
TERFACE," by Timothy N. Prinzing. 

Technical Field 

[0006] This invention generally relates to software ap- 
plications and, more particularly, to a modular storage 
method and apparatus for use with software applica- 
tions. 

Background Art 

[0007] Increasingly, software applications must run 
on a variety of data processing systems ranging from 
small handheld devices to powerful workstations. In 
most cases the software applications must be modified 
to run on each computer system. Some modifications 
may only require recompiling the software for a target 
computer system. Other modifications need to take into 
account physical characteristics of the computer system 
such as limited disk storage, processor power, or mem- 
ory capacity. For example, a small handheld device with 
limited disk storage may have difficulty storing the con- 
tents of large documents used by a particular software 
application. To address this limitation, the storage meth- 
ods used in the software application must be modified 
to accommodate storing the content of large documents 
on devices with small storage capacities. 
[0008] In conventional software applications, howev- 
er, it is often not possible to replace a method of storing 
the contents of a document without rewriting a signifi- 
cant portion of the software application. These conven- 



tional software applications store content and organize 
the structure of the content using an integrated software 
module. That is, a software method that processes con- 
tent such as text also often processes the structural in- 
formation that describes how the text is organized. 
Structural information, for example, includes formatting 
codes and instructions that organize a document into 
paragraphs, sentences, words, and characters. This in- 
tegration of content storage and structural organization 
of a document makes it difficult to replace content stor- 
age routines without affecting other portions of the soft- 
ware application. 

[0009] As a result, there is a tension between devel- 
oping new methods for storing content in an application 
and reusing code. Instead of having several methods 
for storing content, the software developers will create 
a new version of software for each computer system. 
These additional versions increase the development 
costs, increase the costs of customer service, and in- 
crease software maintenance. 
[001 0] Users with different computer systems may al- 
so have problems using a software application available 
in different versions. In such instances, users may have 
to learn to operate different versions of the same soft- 
ware for each of their different computer systems. Cus- 
tomers may also have difficulty moving data and docu- 
ments between computer systems running different ver- 
sions of the same software application. For example, 
content stored using different versions of the same soft- 
ware may not be compatible. 

[0011] To address these limitations, it would be desir- 
able to replace the content storage methods of an ap- 
plication without modifying other portions of the software 
application. This would facilitate replacing a method of 
storing content in a software application without chang- 
ing the method used to structure the content. This would 
promote increased software reuse and decrease soft- 
ware development costs. 

Disclosure of the Invention 

[001 2] In accordance with methods and systems con- 
sistent with the present invention, a method is provided 
that processes storing content of a document separately 
from processing the information used to describe the 
structure of the content. Content can be text, alphanu- 
meric data, graphical data or any other information con- 
tained in a document. A layout operation uses informa- 
tion reflecting the structure of the content of a document 
such as paragraphs, sentences, and characters. 
[0013] Such a method and system provide a content 
entity capable of storing and manipulating content 
stored in a storage area. The manipulation of content in 
the storage area includes inserting content in the con- 
tent entity, removing content from the content entity, and 
retrieving content from the content entity. A separate 
document entity is provided that uses element entities 
to represent the structure of the text without the content 
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of the text. The document entity retrieves information 
from the element entities to determine the location of 
content in the content entity. The information retrieved 
from the element entities is used to manipulate the con- 
tent stored in the storage area associated with the con- 
tent entity. Entities can be implemented in procedural 
software programming languages or object oriented 
programming languages. 

[0014] In accordance with another aspect of the 
present invention, a method is provided capable of re- 
placing a content entity used to store content in a doc- 
ument. A software application loaded into a data 
processing system has a default content entity for stor- 
ing content in a document. The method determines if 
the default content entity or a new content entity should 
be used based on the storage capacities of the data 
processing system. This determination is made by com- 
paring the storage capacity of the system with the stor- 
age characteristics provided by the default content en- 
tity and other content entities. Based on a result of this 
determination, the software application may replace the 
default content entity with a new content entity Upon 
execution, the software application uses a document 
entity to provide the structure of content in the document 
and the new content entity to provide the content asso- 
ciated with the document. 

Brief Description of the Drawings 

[0015] The accompanying drawings, which are incor- 
porated in and constitute a part of this specification, il- 
lustrate an implementation of the invention and, togeth- 
er with the description, serve to explain the principles of 
the invention. 
[0016] In the drawings: 

FIG. 1 depicts a block diagram of a network in which 
methods and systems consistent with the present 
invention may be implemented; 
FIG. 2 depicts a block diagram of a data processing 
system in which methods and systems consistent 
with the present invention may be implemented; 
FIG. 3 depicts a block diagram of software entities 
implemented consistent with the present invention 
used to separate the process of storing content from 
the process of structuring content associated with 
structured text; 

FIG. 4 depicts a flow chart of the operations used 
to replace a content entity for storing content con- 
sistent with one implementation of the present in- 
vention; and * 

FIG. 5 depicts a flow chart of the operations for us- 
ing software entities consistent with one implemen- 
tation of the present invention to process the stor- 
age of content separately from the structure of 
content . 



Best Mode for Carrying Out the Invention 
Overview 

5 [0017] Implementations consistent with the present 
invention facilitate processing the storage of content of 
a document separately from processing associated with 
information reflecting a structure of content. A content 
entity facilitates the storage of content while a document 
io entity facilitates representing the structure of the docu- 
ment. The document can be a text document, a spread- 
sheet, or any other data set used by an application. The 
content entity stores content in a storage area and pro- 
vides commands to manipulate the content. The docu- 
is ment entity uses element entities that represent the 
structure of the document and reference the content 
stored by the content entity. In a text editor, for example, 
element entities can represent combinations of para- 
graphs, sentences, and characters. 
[0018] By replacing the storage methods in a software 
application, data processing systems with different stor- 
age capacities can run the same application and access 
the same information. For example, a software applica- 
tion running on a mainframe may be able to load an ap- 
plication and all the data associated with the application 
on a single disk drive. The mainframe would use a con- 
tent entity that loads the content on a single disk drive 
and then use a separate document entity to determine 
the structure of the content. On a palm top computer 
system with limited storage, the same document entity 
would determine the structure of the content but a dif- 
ferent content entity would be used to store the content. 
Rather than store all the information on a single disk 
drive, the content entity used by the palm top computer 
system may compress the information or may access 
the information as needed in small increments over a 
network such as the Internet. 

System Architecture 

[0019] FIG. 1 is a block diagram of an exemplary net- 
work in which methods and systems consistent with the 
present invention may be implemented. A client system 
102 and a server system 106 are connected together 
through network 104. A software application can exe- 
cute on client system 102 and can access the contents 
of a document on client system 102 using one type of 
storage method. By replacing the storage method, the 
same software application can continue to execute on 
client system 102 but access the contents of the docu- 
ment on server system 106. In other implementations, 
storage methods having compression and other fea- 
tures can also be used to store the contents of a docu- 
ment. 

[0020] FIG. 2 is block diagram of an exemplary data 
processing system with which methods and systems 
consistent with the present invention may be implement- 
ed. Computer system 102 includes a memory 202, a 



25 



30 



35 



40 



45 



50 



3 



5 



EP 0 938 050 A2 



6 



processor 204, a network interface 206, an input-output 
device 208, a keyboard 210, a mouse 21 2, a secondary 
storage 214, and a bus 216 which provides connectivity 
and communication among these subsystems. Network 
interface 206 can be connected to a network such as a 
Local Area Network (LAN), a Wide Area Network 
(WAN), or the Internet. Bus 216 may use a standard bus 
protocol. 

[0021] Memory 202 includes a software application 
218, a development kit (DK) 219, and an operating sys- 
tem 228. Software application 218 may be a business 
software application such as a text editor or spread- 
sheet, a software development tool, or other software 
application that uses a document with structured infor- 
mation. DK 21 9 provides libraries, subroutines, classes, 
drivers, and other software modules accessible to soft- 
ware application 218 using application programming in- 
terfaces (API). 

[0022] For example, DK 21 9 may include a virtual ma- 
chine 226 and classes used in an object-oriented pro- 
gramming language such as the Java™ programming 
language. Virtual machine 226 facilitates platform-inde- 
pendent data processing by simulating the operation of 
an abstract computing machine. Virtual machine 226 re- 
ceives instructions for this abstract computing machine 
in the form of bytecodes. These bytecodes are interpret- 
ed and dynamically converted into a form for execution, 
such as object code, on processor 204. Essentially, 
bytecodes can be executed on any hardware platform 
having virtual machine 226. Java and the Java Virtual 
Machine and other derivatives of Java are trademarks 
of Sun Microsystems, Inc. and are provided as part of 
Sun Microsystems Java Development Kit. 
[0023] A document entity 220, an element entity 222, 
and a content entity 224 designed in accordance with 
implementations of the present invention are also in- 
cluded in DK 219. These entities facilitate processing 
the storage of content of a document separately from 
processing associated with information reflecting the 
structure of content in the document. For example, con- 
tent entity 226 stores content such as text for a text doc- 
ument or alphanumeric characters in a spreadsheet 
while document entity 220 uses element entity 222 to 
represent the structure of the text (content) in the text 
document or layout of alphanumeric characters (con- 
tent) in a spreadsheet. As will be discussed below, these 
entities facilitate replacing storage methods in a soft- 
ware application to suit the application and computer 
system being used. 

Modular Storage Method in a Software Application 

[0024] FIG. 3 is a block diagram of entities implement- 
ed consistent with the present invention used to sepa- 
rate the process of storing content from the process of 
structuring content in document. These entities include 
an abstract document 302, a document entity 304, a 
content entity 306, an element entity 308, an element 



entity 310, and a content buffer 312. In one implemen- 
tation, these entities can be generated using an object- 
oriented programming language such as the Java pro- 
gramming language. Alternatively, these entities can be 

5 implemented using a procedural programming lan- 
guage such as C. These entities can also be implement- 
ed in firmware, hardware, application specific integrated 
circuits, and combinations thereof. 
[0025] In an object-oriented implementation, abstract 

10 document 302 specifies the method names and corre- 
sponding parameters for accessing both the structure 
of content in a document and the content of a document. 
In particular, document interface in abstract document 
302 defines the parameters used to access the structure 

'5 of a document but leaves implementing this method to 
a subclass such as document entity 304. Similarly, con- 
tent interface in abstract document 302 defines the pa- 
rameters used to access the content of the document 
and leaves implementing the method to a subclass such 

20 as content entity 306. Software applications can gain ac- 
cess to the structure of a document and the contents of 
a document using these two respective interfaces. 
[0026] Document entity 304 uses element entities 308 
and 310 to describe the structure of a document. In an 

25 object-oriented language, document entity 304 imple- 
ments the document interface in abstract document 
302. Based on information in document entity 304, ele- 
ment entities can be arranged in a hierarchy to reflect 
the structure of a document. For example, element en- 

30 tity 308 and element entity 31 0 can be used to describe 
a text document having two paragraphs. Element entity 
308 and element entity 310 may also be used to de- 
scribe the structure of a text document having two sen- 
tences or two other elements in a text document. 

35 [0027] The element entities include information used 
to access storage of content through content entity 306. 
Specifically, element entity 308 includes a start position, 
a stop position, and attributes associated with the text. 
The start and stop positions are used as indices into a 

40 storage area used by content entity 306. Attributes are 
used to describe details about formatting the content. 
For text content, formatting information may include at- 
tributes such as bold, underline, or italic. By identifying 
an index and not storing the actual content, storage 

45 methods associated with content entity 306 can be re- 
placed without significant modifications to document en- 
tity 304, element entity 308, element entity 310, or other 
portions of a software application. 
[0028] Content entity 306 provides access to content 

50 stored in a storage area such as content buffer 31 2. Op- 
erations used to manipulate the content in such a stor- 
age area include inserting string to add content, remov- 
ing string to remove content, and reading string to ac- 
cess content. In this example, operations associated 

55 with content entity 306 are specifically designed to work 
with the way information is stored in content buffer 31 2. 
[0029] Content buffer 31 2 is a storage area that stores 
content information as a series of alphanumeric charac- 
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ters. A start position and a stop position identify contents 
in content buffer 312. Each start and stop position cor- 
responds to an element associated with document entity 
304 such as element entity 308 or element entity 310. 
[0030] Content entity 306 updates start and stop po- s 
sitions as content is added and removed from content 
buffer 312. For example, if content is added between 
the start position and the stop position then the stop po- 
sition is advanced to indicate the increased content with- 
in an element. Because content entity 306 works with a 
particular storage area such as content buffer 312, re- 
placing content buffer 312 with a new type of storage 
area also requires a new content entity to manipulate 
the stored information. 

[0031 ] Methods of storing and organizing information 
in a sequence of storage locations such as content buff- 
er 31 2 include conventional string manipulation and gap 
storage methods. Additional storage methods may also 
include memory-mapped files, piece-table storage, re- 
mote database storage over a network, and embedded 
document storage in several documents. 

Replacing Storage Methods in a Software 
Application 

[0032] FIG. 4 is a flow chart of the operations used to 
replace a content entity for storing content consistent 
with one implementation of the present invention. Initial- 
ly, a software application is loaded into a computer sys- 
tem (step 402). The software application can be any ap- 
plication that processes text or other information ar- 
ranged according to a particular structure. The software 
application initially has a default content entity for man- 
aging the storage of text or other information. Loading 
the software application includes storing the application 
in memory or other storage area and preparing it for ex- 
ecution. A determination as to a modification the default 
content entity should be used with the particular soft- 
ware application executing on the computer system 
(step 404). For this purpose a comparison is made be- 
tween storage characteristics of numerous content en- 
tities against the available memory, processor capabili- 
ties, and disk storage associated with the computer sys- 
tem. For example, on a hand held computer system a 
content entity that provides remote storage and retrieval 
over a network may provide the appropriate storage so- 
lution. If the default content entity is not selected, a new 
content entity replaces the default content entity in the 
software application (step 406). As previously dis- 
cussed, the content entity in a software application can 
be replaced without replacing the document entity used 
to arrange the structure of text or information in a doc- 
ument. 

[0033] Next, the new content entity is associated with 
the existing document entity used to process the struc- 
ture of text used in the document (step 408). At this 
point, the new content entity may receive information 
stored in the old content entity and convert it into a for- 



mat compatible with the new content entity. This in- 
cludes copying information between two storage areas 
as well as updating the start and stop positions for each 
element entity used in the document entity Next, the 
software application is associated with both the docu- 
ment entity and content entity (step 410). This includes 
retrieving the structure of the document and retrieving 
the corresponding content associated with the structure. 
This information is then provided to the software appli- 
cation. For example, this may include retrieving the text 
information initially displayed on the computer screen of 
a text editor when the text editor begins editing a docu- 
ment. Next, the software application begins execution 
using the document entity to access the structure of con- 
tent in a document and the content entity to access the 
content (step 412). 

[0034] FIG . 5 is a flow chart of the operations for using 
software entities consistent with one implementation of 
the present invention to process the storage of content 
separately from the structure of content in a document. 
Initially, provision is made for a content entity capable 
of storing and manipulating the content associated with 
a document (step 502) and for a document entity that 
uses element entities to represent the structure of the 
content (step 504). For example, the document entity 
may create a hierarchy of element entities correspond- 
ing to a hierarchy of paragraphs, sentences, and char-, 
acters in a document. Next, the software application re- 
trieves information from the entity elements to deter- 
mine the location of content in the content entity (step 
506). For example, the software application may access 
an element entity to determine the start and stop posi- 
tions of a particular paragraph in a document. This in- 
formation retrieved from the element entities is then 
used to manipulate the content stored in the content en- 
tity (step 508). For example, the start position and stop 
position retrieved from an element entity may be used 
to access the content information stored in the content 
entity. 

Conclusion 

[0035] While specific implementations have been de- 
scribed herein for purposes of illustration, various mod- 
ifications may be made without departing from the spirit 
and scope of the invention. For example, discussions of 
applying this invention to software applications such as 
text editors or spreadsheets can also apply to other soft- 
ware applications that process information arranged in 
a structure. Those skilled in the art understand that the 
present invention can be implemented in a wide variety 
of software platforms and is not limited to the Java pro- 
gramming language and development environment. 
Furthermore, although aspects of the present invention 
are described as being stored in memory and other stor- 
age mediums, one skilled in the art will appreciate that 
these aspects can also be stored on or read from other 
types of computer-readable media, such as secondary 
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storage devices, like hard disks, floppy disks, or CD- 
ROM; a carrier wave from the Internet; or other forms of 
RAM or ROM. Accordingly, the invention is not limited 
to the above described embodiments, but instead is de- 
fined by the appended claims in light of their full scope 
of equivalents. 



Claims 

1. A method, executed on a data processor, of 
processing the contents of a document arranged in 
a structure, comprising: 

providing a content entity capable of storing 
and manipulating the content associated with 
the document without the structure of the con- 
tent, wherein the manipulating further compris- 
es, 

inserting content in the content entity iden- 
tified by a start position and a stop position, 
removing content from the content entity, 
or 

retrieving content from the content entity; 

providing a document entity that uses element 
entities to represent the structure of the content 
without the actual content, further comprising, 

arranging the element entities in a hierar- 
chy corresponding to the structure of the 
content in the document, and 
associating content in the content entity 
with element entities having a start position 
and a stop position; 

retrieving information from the element entities 
to determine the location of content in the con- 
tent entity; and 

manipulating the content stored in the storage 
area associated with the content entity accord- 
ing to the information retrieved from the ele- 
ment entities. 

2. A method, executed on a data processor, compris- 
ing: 

providing a first entity capable of storing and 
manipulating content of a document; 
providing a second entity reflecting a structure 
associated with the content of the document; 
retrieving information from the second entity 
identifying a location of particular content 
stored in the content entity; and 
manipulating the position of the particular con- 
tent according to the information retrieved from 
the second entity. 



3. The method of claim 2, wherein the manipulating 
further comprises, inserting the particular content in 
the content entity identified by a start position and 
a stop position. 

5 

4. The method of claim 2, wherein the manipulating 
further comprises, removing the particular content 
from the content entity. 

10 5. The method of claim 2, wherein the manipulating 
further comprises, retrieving the particular content 
from the content entity. 

6. The method of claim 2, wherein providing the doc- 
75 ument entity further comprises, 

arranging element entities in a hierarchy corre- 
sponding to the structure associated with the 
content of the document; and 
20 associating content in the content entity with at 

least one of the element entities having a start 
position and a stop position. 

7. A method, executed on a computer processor, for 
25 replacing a content entity used to store content in a 

document, comprising: 

loading a software application onto the compu- 
ter system wherein the software application has 
30 a default content entity for storing content in a 

document; 

determining whether the default content entity 
should be used by the software application 
when executing on the computer system; 
35 replacing the default content entity with a new 

content entity depending on the determination; 
and 

associating the software application with a doc- 
ument entity that provides the structure of the 
40 content and the new content entity that pro- 

vides the content associated with the docu- 
ment. 

8. The method of claim 7, wherein the determining f ur- 
45 ther comprises: 

identifying the storage capacity of the computer 
system; and 

comparing the storage capacity of the compu- 
50 ter system with the storage characteristics pro- 

vided by the default content entity and other 
content entities. 

9. An apparatus configured to process the contents of 
55 a document arranged in a structure, comprising: 

a processor; 

a memory having instructions executable on 
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the processor that provide a content entity ca- 
pable of storing and manipulating the content 
associated with the document, provide a docu- 
ment entity that uses element entities to repre- 
sent the structure of the content, retrieve infor- 5 
mation from the element entities to determine 
the location of content in the content entity, and 
manipulate the content stored in the storage ar- 
ea associated with the content entity according 
to the information retrieved from the element io 
entities. 

10. An apparatus configured to replace a content entity 
used to store content in a document, comprising: 

75 

a processor; 

a memory having instructions executable on 
the processor that loads a software application 
onto a computer system wherein the software 
application has a default content entity for stor- 20 
ing content in a document, determines if the de- 
fault content entity should be used by the soft- 
ware application when executing on the com- 
puter system, replaces the default content en- 
tity with a new content entity depending on the 2s 
determination, and associates the software ap- 
plication with a document entity that provides 
the structure of the content and the new content 
entity that provides the content associated with 
the document. 30 

11. An apparatus for processing the contents of a doc- 
ument arranged in a structure, comprising: 

means for providing a content entity capable of 35 
storing and manipulating the content associat- 
ed with the document; 

means for providing a document entity that us- 
es element entities to represent the structure of 
the content; 40 
means for retrieving information from the ele- 
ment entities to determine the location of con- 
tent in the content entity; and 
means for manipulating the content stored in 
the storage area associated with the content 45 
entity according to the information retrieved 
from the element entities. 

12. An apparatus for replacing a content entity used to 
store content in a document, comprising: so 

means for loading a software application onto 
a computer system wherein the software appli- 
cation has a default content entity for storing 
content in a document; ss 
means for determining if the default content en- 
tity should be used by the software application 
when executing on the computer system; 



means for replacing the default content entity 
with a new content entity depending on the de- 
termination; and 

means for associating the software application 
with a document entity that provides the struc- 
ture of the content and the new content entity 
that provides the content associated with the 
document. 
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